Cluster Stability

نویسندگان

  • Zeev Barzily
  • Zeev Volkovich
  • Başak Akteke-Öztürk
  • Gerhard-Wilhelm Weber
چکیده

In this paper, a method for the study of cluster stability is purposed. We draw pairs of samples from the data, according to two sampling distributions. The first distribution corresponds to the high density zones of data-elements distribution. It is associated with the clusters cores. The second one, associated with the cluster margins, is related to the low density zones. The samples are clustered and the two obtained partitions are compared. The partitions are considered to be consistent if the obtained clusters are similar. The resemblance is measured by the total number of edges, in the clusters minimal spanning trees, connecting points from different samples. We use the Friedman and Rafsky two sample test statistic. Under the homogeneity hypothesis, this statistic is normally distributed. Thus, it can expected that the true number of clusters corresponds to the statistic empirical distribution which is the closest to normal. Numerical experiments demonstrate the ability of the approach to detect the true number of clusters.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Density Functional Study on Stability and Structural Properties of Cu n clusters

In this research DFT/B3LYP method has been employed to investigate the geometrical structures,relative stabilities, and electronic properties of Cun (n=3–10) clusters for clarifying the effect of sizeon the properties. Through a careful analysis of the successive binding energies, second-orderdifference of energy and the highest occupied-lowest unoccupied molecular orbital energy gaps as afunct...

متن کامل

Genetic Stability of Micropropagated Plantlets in Date Palm

Randomly amplified polymorphic DNA (RAPD) markers were used to analyze genetic stability of the somatic embryogenesis-derived regenerants (R1-6) and mother plant in Iranian date palm (Phoenix dactylifera L.) cultivar Khanizi. Total genomic DNA extracted from in vitro fresh leaves of regenerated plants and mother plant was amplified using 10-mer oligonucleotide Fermantas primers. Four primers of...

متن کامل

Node Based Cluster Routing Algorithm for Mobile Ad-Hoc Network

Mobility of node is an important subject at the time of clustering in mobile ad hoc network as it directly affects the strength of the cluster. MANETs clustering based algorithms commonly suffers with crash of cluster-head problem, which degrades the cluster stability. This paper proposes, Node Based Cluster Routing Algorithm (NBCRA), a schema to improve the cluster stability and in-turn to imp...

متن کامل

Node Cluster Stability in Vehicular Ad hoc Networks

In recent years, efforts have been made to deploy communication capabilities in vehicles and the transport infrastructure, leading to a potential of vehicular ad hoc networks (VANETs). In the envisioned VANET, communications among vehicles will enhance the intelligent transportation systems (ITS) and support not only public-safety applications, but also a wide range of infotainment applications...

متن کامل

Cluster Stability for Finite Samples

Over the past few years, the notion of stability in data clustering has received growing attention as a cluster validation criterion in a sample-based framework. However, recent work has shown that as the sample size increases, any clustering model will usually become asymptotically stable. This led to the conclusion that stability is lacking as a theoretical and practical tool. The discrepancy...

متن کامل

A Resampling Approach to Cluster Validation

The concept of cluster stability is introduced as a means for assessing the validity of data partitionings found by clustering algorithms. It allows us to explicitly quantify the quality of a clustering solution, without being dependent on external information. The principle of maximizing the cluster stability can be interpreted as choosing the most self-consistent data partitioning. We present...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008